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Abstract — We investigate optimal routing and scheduling 
strategies for multi-hop wireless networks with rateless codes. 
Rateless codes allow each node of the network to accumulate 
mutual information from every packet transmission. This en- 
ables a significant performance gain over conventional shortest 
path routing. Further, it outperforms cooperative communication 
techniques that are based on energy accumulation. However, 
it requires complex and combinatorial networking decisions 
concerning which nodes participate in transmission, and which 
decode ordering to use. We formulate three problems of interest 
in this setting: (i) minimum delay routing, (ii) minimum energy 
routing subject to delay constraint, and (iii) minimum delay 
broadcast. All of these are hard combinatorial optimization prob- 
lems and we make use of several structural properties of their 
optimal solutions to simplify the problems and derive optimal 
greedy algorithms. Although the reduced problems still have 
exponential complexity, unlike prior works on such problems, 
our greedy algorithms are simple to use and do not require 
solving any linear programs. Further, using the insight obtained 
from the optimal solution to a line network, we propose two 
simple heuristics that can be implemented in polynomial time 
and in a distributed fashion and compare them with the optimal 
solution. Simulations suggest that both heuristics perform very 
close to the optimal solution over random network topologies. 

Index Terms — Mutual Information Accumulation, Rateless 
Codes, Minimum Delay Routing, Minimum Energy Routing, 
Minimum Delay Broadcast 



I. Introduction 

Cooperative communication promises significant gains in 
the performance of wireless networks over traditional tech- 
niques that treat the network as comprised of point-to-point 
links. Cooperative communication protocols exploit the broad- 
cast nature of wireless transmissions and offer spatial diver- 
sity gains by making use of multiple relays for cooperative 
transmissions. This can increase the reliability and reduce the 
energy cost of data transmissions in wireless networks. See 
[1J for a recent comprehensive survey. 

Most prior work in the area of cooperative communication 
has investigated physical layer techniques such as orthogonal 
repetition coding/signaling [2], distributed beamforming Oil , 
distributed space-time codes [4], etc. All these techniques 
perform energy accumulation from multiple transmissions to 
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decode a packet. In energy accumulation, a receiver can 
decode a packet when the total received energy from mul- 
tiple transmissions of that packet exceeds a certain threshold. 
An alternate approach of recent interest is based on mutual 
information accumulation 0. In this approach, a node 
accumulates mutual information for a packet from multiple 
transmissions until it can be decoded successfully. This is 
shown to outperform energy accumulation based schemes, 
particularly in the high SNR regime, in . Such a scheme 
can be implemented in practice using rateless codes of which 
Fountain and Raptor codes ID-El are two examples. Rateless 
codes encode information bits into potentially infinite-length 
codewords. Subsequently, additional parity bits are sent by the 
transmitter until the receiver is able to decode. The decoding 
procedure at the receiver is carried out using belief propagation 
algorithms. For an overview of the working of Fountain, LT, 
and Raptor codes, we refer to the excellent survey in ifTOl . 

In addition to allowing mutual information accumulation, 
rateless codes provide further advantages over traditional fixed 
rate schemes in the context of fading relay networks as 
discussed in fiTIl fT2ll . Unlike fixed rate code schemes in 
which knowledge of the current channel state information 
(CSI) is required at the transmitters, rateless codes adapt to 
the channel conditions without requiring CSI. This advantage 
becomes even more important in large networks where the cost 
of CSI acquisition grows exponentially with the network size. 
However, this introduces deep memory in the system because 
mutual information accumulated from potentially multiple 
transmissions in the past can be used to decode a packet. 

In this paper, we study three problems on optimal routing 
and scheduling over a multi-hop wireless network using mu- 
tual information accumulation. Specifically, we first consider 
a network with a single source-destination pair and n relay 
nodes. When a node transmits, the other nodes accumulate 
mutual information at a rate that depends on their incoming 
link capacity. All nodes operate under bandwidth and energy 
constraints as described in detail in Section HI We consider 
three problems in this setting. In the first problem, the transmit 
power levels of the nodes are fixed and the objective is 
to transmit a packet from the source to the destination in 
minimum delay (Section HIT1) . In the second problem, the 
transmit power levels are variable and the objective is to 
minimize the sum total energy to deliver a packet to the 
destination subject to a delay constraint (Section HvT) . In the 
third problem, we consider the network model with fixed 
transmit power levels (similar to the first problem) and with a 
single source where the objective is to broadcast a packet to 
all the other nodes in minimum delay (Section [V]). All of these 
objectives are important in a variety of networking scenarios. 
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Related problems of optimal routing in wireless networks 
with multi-receiver diversity have been studied in fT3l - |[T6l 
while problems of optimal cooperative diversity routing and 
broadcasting are treated in [17]-|20] and references therein. 
Although these formulations incorporate the broadcast nature 
of wireless transmissions, they assume that the outcome of 
each transmission is a binary success/failure. Further, any 
packet that cannot be successfully decoded in one transmission 
is discarded. This is significantly different from the scenario 
considered in this paper where nodes can accumulate partial 
information about a packet from different transmissions over 
time. This can be thought of as networking with "soft" 
information. 

Prior work on accumulating partial information from multi- 
ple transmissions includes the work in (6J, |[2Tll-[[26l. Specifi- 
cally, [21] considers the problem of minimum energy unicast 
routing in wireless networks with energy accumulation and 
shows that it is an NP-complete problem. Similar results are 
obtained for the problem of minimum energy accumulative 
broadcast in l22l -l'241. A related problem of accumulative 
multicast is studied in l25l . Minimum energy unicast routing 
with energy accumulation only at the destination is considered 
in l26ll . The work closest to ours is (6) which treats the 
minimum delay routing problem with mutual information 
accumulation. Both [23 ] |6] develop an LP based formulation 
for their respective problems that involves solving a linear 
program for every possible ordering of relay nodes over all 
subsets of relay nodes to derive the optimal solution. Thus, 
for a network with n relay nodes, this exhaustive approach 
requires solving J2m=i (™) m ' > n ' linear programs. 

The primary challenge associated with solving the problems 
addressed in this paper is their inherent combinatorial nature. 
Unlike traditional shortest path routing problems, the cost 
of routing with mutual information accumulation depends 
not only on the set of nodes in the routing path, but also 
their relative ordering in the transmission sequence, making 
standard shortest path algorithms inapplicable. Therefore, we 
approach the problem differently. To derive the optimal trans- 
mission strategy for the first problem, we first formulate an 
optimization problem in Section IIII-BI that optimizes over all 
possible transmission orderings over all subsets of relay nodes 
(similar to ll23l El). This approach clearly has a very high 
complexity of 0(n\). Then in Section IBl-Cl we prove a key 
structural property of the optimal solution that allows us to 
simplify the problem and derive a simple greedy algorithm 
that only needs to optimize over all subsets of nodes. Further, 
it does not require solving any linear programs. Thus, it has 
a complexity of 0(2 n ). We derive a greedy algorithm of 
the same complexity for the second problem in Section [IVl 
We note that this complexity, while still exponential, is a 
significant improvement over solving 0(n\) linear programs. 
For example, with n = 10, this requires 2 10 = 1024 runs of a 
simple greedy algorithm as compared to 10! = 3628800 runs 
of an LP solver. Note that for small networks, (say, n < 10), 
it is reasonable to use our algorithm to exactly compute the 
optimal solution. Further, for larger n it provides a feasible 
way to compute the optimal solution as a benchmark when 
comparing against simpler heuristics. 




Fig. 1. Example network with source, destination and 4 relay nodes. When 
a node transmits, every other node that has not yet decoded the packet 
accumulates mutual information at a rate given by the capacity of the link 
between the transmitter and that node. 

For the minimum delay broadcast problem, we identify a 
similar structural property of the optimal solution in Section 
IVl that allows us to simplify the problem and derive a simple 
greedy algorithm. While this greedy algorithm still has a 
complexity of 0(n!), it does not require solving any linear 
programs and thus improves over the result in |23ll that 
requires solving n! linear programs. In general, we expect 
all these problems to be NP-complete based on the results in 
ll2Tll - ll24ll . For the special case of a line topology, we derive 
the exact optimal solution in Section IIII-D1 Finally, in Section 
[VTI we propose two simple heuristics that can be implemented 
in polynomial time and in a distributed fashion and compare 
them with the optimal solution. Simulations suggest that both 
heuristics perform quite close to the optimal solution over 
random network topologies. 

Before proceeding, we note that the techniques we apply 
to get these structural results can also be applied to similar 
problems that use energy accumulation instead of mutual 
information accumulation. 

II. Network Model 

The network model consists of a source s, destination d 
and n relays r±, r2, • • • , r n as shown in Fig. [T] There are no 
time variations in the channel states. This models the scenario 
where the coherence time of the channels is larger than any 
considered transmission time of the encoded bits. In the first 
two problems, the source has a packet to be delivered to the 
destination. In the third problem, the source packet must to 
delivered to all nodes in the network. 

Each node i transmits at a fixed power spectral density 
(PSD) Pi (in units of joules/sec/Hz) that is uniform across 
its transmission band. However, the transmission duration for 
a node is variable and is a design parameter. The total available 
bandwidth is If Hz. A node can transmit the packet only if 
it has fully decoded the packet. For this, it must accumulate 
at least Imax bits of total mutual information. 

All transmissions happen on orthogonal channels in time 
or frequency and at most one node can transmit over a fre- 
quency channel at any given time. The channel gain between 
nodes i and j is given by hij which is independent of the 
frequency channel. However, this is not necessarily known at 
the transmitting nodes. 
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Fig. 2. Example timeslot and transmission structure. In each stage, nodes 
that have already decoded the full packet transmit on orthogonal channels in 
time. 

Under this assumption, the minimum transmission time 
under the two orthogonal schemes (where nodes transmit in 
orthogonal time vs. frequency channels) is the same. In the 
following, we will focus on the case where transmissions are 
orthogonal in time. When a node i transmits, every other 
node j that does not have the full packet yet, receives mutual 
information at a rate that depends on the transmission capacity 
Cij (in units of bits/sec/Hz) of link i — j. This transmission 
capacity itself depends on the transmit power Pi and channel 

channel, using 

haPi~ 



N 



strength hij. For example, for an AWGN 
Shannon's formula, this is given by = log 2 1 
where No/2 is the PSD of the noise process. Specifically, if 
node i transmits for duration A over bandwidth W, then node 
j accumulates AWCij bits of information. In the following, 
we assume W = 1 for simplicity. We assume that nodes use 
independently generated ideal rateless codes so that the mutual 
information collected by a node from different transmissions 
add up. We can incorporate the non-idealities of practical 
rateless codes by multiplying dj with a factor 1/(1 + e) where 
e > is the overhead. A similar model has been considered 
in 0. 

III. Minimum Delay Routing 

Under the modeling assumptions discussed in Section HH the 
problem of routing a packet from the source to the destination 
with minimum delay consists of the following sub-problems: 

• First, which subset of relay nodes should take part in 
forwarding the packet? 

• Second, in what order should these nodes transmit? 

• And third, what should be the transmission durations for 
these nodes? 

We next discuss the transmission structure of a general policy 
under this model. 



A. Timeslot and Transmission Structure 

Consider any transmission strategy Q for routing the packet 
to the destination in the model described above. This includes 
the choice of the relay set, the transmission order for this set, 
and the transmission durations for each node in this set. Let 1Z 
denote the subset of relay nodes that take part in the routing 
process under strategy Q. By this, we mean that each node 
in 1Z is able to decode the packet before the destination and 
then transmits for a non-zero duration. There could be other 
nodes that are able to decode the packet before the destination, 



but these do not take part in the forwarding process and are 
therefore not included in the set 1Z. 

Let k — \1Z\ be the size of this set. Also, let O be the 
ordering of nodes in 1Z that describes the sequence in which 
nodes in 1Z successfully decode the packet under strategy Q. 
Without loss of generality, let the relay nodes in the ordering O 
be indexed as 1, 2, 3, . . . , k. Also, let the source s be indexed 
as and the destination d be indexed as k + 1. Initially, 
only the source has the packet. Let to be the time when it 
starts its transmission and let ti, £2, • • • , tk denote the times 
when relays 1, 2, . . . , k in the ordering O accumulate enough 
mutual information to decode the packet. Also, let tk+i be the 
time when the destination decodes the packet. By definition, 

< h < h < • • • < tk < tfc+i. We say that the transmission 
occurs over k + 1 stages, where stage j, j G {0, 1, 2, . . . , k} 
represents the interval [tj,tj+i\. The state of the network at 
any time is given by the set of nodes that have the full packet 
and the mutual information accumulated so far at all the other 
nodes. Note that in any stage j, the first j nodes in the ordering 
O and the source have the fully decoded packet. Thus, any 
subset of these nodes (including potentially all of them) may 
transmit during this stage. Then the time- slot structure for the 
transmissions can be depicted as in Fig. [2] Note that in each 
stage, the set of relays that have successfully decoded the 
packet increases by one (we ignore those relays that are not 
part of the set 1Z). 

We are now ready to formulate the problem of minimum 
delay routing with mutual information accumulation. 

B. Problem Formulation 

For each j, define the duration of stage j as Aj = tj+i—tj. 
Also, let Aij denote the transmission duration for node i in 
stage j under strategy Q. Note that = if i > j, else A^ > 
0. This is because node i does not have the full packet until 
the end of stage i — 1. The total time to deliver the packet to 
the destination T to t is given by T tot = t k+1 -t = ^=0 Aj. 
For any transmission strategy Q that uses the subset of relay 
nodes 1Z with an ordering O, the minimum delay is given by 
the solution to the following optimization problem: 

k 

Minimize: T to t — Aj 
3=0 

m — 1 m — l 

Subject to: ^ ^ A^dm > I max Vra G {1, 2, . . . , k + 1} 

2=0 j=0 
3 

Y^Aij < Aj Vje {0,1,2,...,*;} 

2=0 

A^ > Vi G {0, 1,2,..., k}J G {0, 1, 2, ... , k} 
A^ = Vz > j 

Aj > V j G {0, 1, 2, . . . , (1) 

Here, the first constraint captures the requirement that node 
m in the ordering must accumulate at least I m ax amount 
of mutual information by the end of stage m — l using all 
transmissions in all stages up to stage m — l. The second 
constraint means that in every stage j, the total transmission 
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Fig. 3. Optimal timeslot and transmission structure. In each stage, only the 
node that decodes the packet at the beginning of that stage transmits. 

time for all nodes that have the fully decoded packet in that 
stage cannot exceed the length of that stage. 

It can be seen that the above problem is a linear program 
and thus can be solved efficiently for a given relay set 1Z 
and its ordering O. Indeed, this is the approach taken in O 
that proposes solving such a linear program for every possible 
ordering of relays for each subset of the set of relay nodes. 
While such an approach is guaranteed to find the optimal 
solution, it has a huge computational complexity of 0(n!) 
linear programs. In the next section, we show that the above 
computation can be simplified significantly by making use of 
a structural property of the optimal solution. 

C. Characterizing the Optimal Solution of ([7]) 

Let lZ op t denote the subset of relay nodes that take part in 
the routing process in the optimal solution. Let k = \1Z opt \ 
be the size of this set. Also, let O opt be the optimal ordering. 
Note that, by definition, each node in !Z opt transmits for a 
non-zero duration (else, we can remove it from the set without 
affecting the minimum total transmission time). Then, we have 
the following: 

Theorem 1: Under the optimal solution to the minimum 
delay routing problem (Q]), in each stage j, it is optimal for 
only one node to transmit, and that node is node j. 

Fig. [3] shows the timeslot structure under the optimal solu- 
tion. The above theorem shows that only one node transmits 
in each stage, and that the optimal transmission ordering is 
the same as the ordering that nodes in the set !Z opt decode 
the packet. Comparing this with the general timeslot structure 
in Fig. [2 it can be seen that Theorem [T] simplifies problem 
(Q} significantly. Specifically, Theorem \T\ implies that, given 
the optimal relay set 7Z op t, the optimal transmission structure 
(i.e., the decoding order and the transmission durations) can 
be computed in a greedy fashion as follows. First, the source 
starts to transmit and continues to do so until any relay node in 
this set gets the packet. Once this relay node gets the packet, 
we know from Theorem [T] that the source does not transmit in 
any of the remaining stages. This node then starts to transmit 
until another node in the set gets the packet. This process 
continues until the destination is able to decode the packet. 
The optimal solution to (OD) can then be obtained by applying 
this greedy transmission strategy to all subsets of relay nodes 
and picking one that yields the minimum delayQ Note that 

! We note that the transmission structure characterized by Theorem [T] is 
similar to the wavepath property shown in 1211 for the problem of minimum 
energy unicast routing with energy accumulation in wireless networks. How- 
ever, our proof technique is significantly different. 



applying this greedy transmission strategy does not require 
solving an LP. While searching over all subsets still has an 
exponential complexity of 0(2 n ), it can be used to compute 
the optimal solution as a benchmark. 

Theorem Q] also implies that multiple copies of the packet 
need not be maintained across the network. For example, note 
that the source need not transmit after the first relay has 
decoded the packet and therefore can drop the packet from 
its queue. 

We emphasize that the optimal transmission structure sug- 
gested by Theorem Q] is not obvious. For example, at the 
beginning of any stage, the newest addition to the set of relay 
nodes with the full packet may not have the best links (in terms 
of transmission capacity) to all the remaining nodes, including 
the destination. This would suggest that under the optimal 
solution, in general in each stage, nodes with the full packet 
should take turns transmitting the packet. However, Theorem 
[T] states that such time-sharing is not required. 

Before proceeding, we present a preliminary lemma that is 
used in the proof of Theorem [T] Consider any linear program: 

Minimize: c T x 
Subject to: Ax = b 

x>0 (2) 

where x G R n . Then we have the following: 

Lemma 1: Let x* be an optimal solution to the problem © 
such that x* > (where the inequality is taken entry wise). 
Then x* is still an optimal solution when the constraint x > 
is removed. 

Lemma Q] implies that removing an inactive constraint does 
not affect the optimal solution of the linear program. This is 
a simple fact and its proof is provided for completeness in 
Appendix A. 

Proof: (Theorem\Q: Note that Theorem [T] trivially holds 
in stage (since only the source has the full packet in this 
stage). Next, it is easy to see that in the last stage (i.e., stage 
k), only the node with the best link (in terms of transmission 
capacity) to the destination in the set lZ opt should transmit in 
order to minimize the total delay. This is because this node 
will take the smallest time to transmit the remaining amount of 
mutual information needed by d to decode the packet. Further, 
we claim that this node must be the node k in the ordering 
Oopt- This can be argued as follows. Assume that the node 
with the best link to the destination in the set !Z opt has the 
full packet at some stage (k — j) (where < j < k) before 
the start of stage k. Then a smaller delay can be achieved by 
having only this node transmit after it has decoded the full 
packet from that stage onwards. Thus, the other nodes labeled 
k — j, . . . , k — 1 in the transmission order do not transmit, a 
contradiction. This shows that under the optimal solution, in 
the last stage k, only node k in the ordering O opt transmits. 
Using induction, we now show that in every prior stage (k — j) 
where 1 < j < k — 1, only one node needs to transmit and 
that this node must be node k — j in the ordering O opt . 

Consider the (k — l) th stage. At time tk-i, all nodes 
except k and d have decoded the packet. Let the mutual 
information state at nodes k and d at time th-\ be Ik(tk-i) 
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and Id(tk-i) respectively. Also, suppose in the (fc — l) th stage, 
relay nodes 1, 2, . . . , k — 1 and the source transmit a fraction 
c^i -1 , a* -1 , • • • , (*k-i and a o _1 of the total duration of stage 
(fc — 1), i.e., Afc_i respectively. Note that these fractions must 
add to 1 since it is suboptimal to have any idle time (where 
no one is transmitting). Then, the optimal solution must solve 
the following optimization problem: 



eliminate A k from the objective and ignoring the constant 
terms, © can be expressed as: 



fc-i 



Minimize: ^ Pi(C kd - C id ) 



k-l 



Subject to: h{t k -i) + @ iCik = Irt 



Minimize: A k _i + A* 



i=0 



k-l 



Subject to: I k (t k -i) + A fc _i ^ o£ 1 C ik > I n 

i=0 
k-l 



ft >0 Vie {0,1,2,. ..,£-1} (5) 

This optimization problem is linear in ft with a single linear 
equality constraint and thus the solution is of the form where 



all except one ft are zero. Since a i 



k-l _ _JU 



-, we have 



Id(tk-i) + Afc_i ^ Cid + A k C k d > Imax that in the optimal solution, exactly one of the fractions 



i=0 

U < a , a x , . . . , < 1 

k-i 

i=0 

A fc _! >0,A fe >0 



(3) 



Here, the first constraint states that relay k must accumulate 
at least Imax bits of mutual information by the end of stage 
(k — 1). The second constraint states that the destination must 
accumulate at least Imax bits of mutual information by the 
end of stage k. Note that in the last term of the left hand side 
of the second constraint, we have used the fact that only node 
k transmits during stage k. 

It is easy to see that under the optimal solution, the first 
and second constraints must be met with equality. This simply 
follows from the definition of the beginning of any stage j 
as the time when node j has just decoded the packet. Next, 



let ft 



_1 for all i £ {0, 1, 2, . . . , k - 1}. Since 



Yli=o a i 1 = 1, we have that J2i=o ft = ^k-i and © is 
equivalent to: 



k-l 



Minimize: ft + A& 



2=0 



k-l 



Subject to: h(t k -i) + ^ftC i/c = I n 



i=0 
k-l 



Idifk-l) + ^ PiCid + A k C k d — Imax 
i=0 

A k > 0, ft > Vi G {0,1,2, 1} (4) 

Note that problems © and © are equivalent because we can 
transform © to the original problem by using the relations 
A/c-i = Yli=o Pi anc * c^ -1 = a~~~7- The degenerate case 
where A^-i =0 does not arise because if A^_i =0, then no 
node transmits in stage (k — 1) and we transition to stage k 
in which only node k transmits. This means node k — 1 never 
transmits, contradicting the fact that it is part of the optimal 
transmission schedule. 

Since we know that under the optimal solution, A& > 0, we 
can remove the constraint A k > from © without affecting 
the optimal solution (using Lemma []])• Next we multiply 
the minimization objective in © by C k d without changing 
the problem. Then, using the second equality constraint to 



c^q -1 , . . . , a^-i * s ec l ua l to 1 and rest must be 0. This 

implies that only one node transmits in this stage. Further, this 
node must be the relay node k — 1 that decoded the packet at 
the beginning of this stage. Else, node k — 1 never transmits. 
This is because by definition of stage (k — 1), node k — 1 does 
not have the packet before the beginning of stage (k — 1) and 
hence cannot transmit before stage {k — 1). Since only node 
k transmits when stage (k — 1) ends, if node k — 1 is not the 
node chosen for stage {k — 1), it never transmits, contradicting 
the fact that it is part of the optimal set. H 

Now consider the (k — j) th stage and suppose Theorem Q] 
holds for all stages after stage (k — j) where 2 < j < k — 1. 
This means that in every stage after stage (k — j), only the 
node that has just decoded the packet transmits. At time t k -j, 
all nodes except k—j + l,k—j + 2,...,k and d have decoded 
the packet. Let the mutual information state at these nodes at 
time t k -j be I k -j +1 (t k -j), I k -j+ 2 (t k -j), • • • , I k (t k -j) and 
Id(t k -j), respectively. Also, suppose in the (k — j) th stage, 
the source and the relay nodes 1, 2, . . . , k— j transmit a fraction 



k—j k—j k—j 



, ol^Ja of the total duration of stage (k- 



j), i.e., A k -j respectively. Then, the optimal solution must 
solve the following optimization problem: 



Minimize: A k -j+ Ti 



k-j 

Subject to: I k _ j+1 (t k -j) + A fc -j [ ^ a^~ J C ijk - j+1 

2=0 

k-j 

Ik-j+nifk-j) + A/e_j Cj\ ° Ci^ k -j+ n 



k-j, 



i=0 



n-1 



^ A k -j+iC k -j+i ik -j+ n = Imax Vn G {2, . . . , j + 1} 



fc-j 



0<a J ,...,a /e _j < l,2^«i -1 



z=0 



Afc-j >0,A fc - i+ i >0,...,A fc >0 



(6) 



where the first constraint states that relay k — j + 1 must 
accumulate i ma x bits of mutual information by the end of 

2 This is a crucial property that holds only for the unicast routing case. As 
we will see in Section [V] this does not necessarily hold for the minimum 
delay broadcast problem. 
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X^=0 PiCi,k-j + 2 

J2i=o PiCi,d 



+ 



Ck-j+l,k-j+2 
Ck-j+l,k-j+3 



Ck- 



3 + l,d 



1 





a 



k,d 



Imax Ik— i+2 (fk — j ) 
Imax Ik—j-\s{tk — j) 

I max Idifk—j) 



TABLE I 

Second set of constraints in (Q in Matrix form. 



stage (fc — j). The second set of constraints state that every 
subsequent node k—j + n (where 2<n<j + l) including the 
destination in the ordering O op t must accumulate Imax bits of 
mutual information by the end of stage (k — j + n). In the last 
term of the left hand side of each such constraint, we have used 
the induction hypothesis that in every stage after stage (k — j), 
only the node that just decoded the packet transmits. Using the 
transform ft = A k -ja^~^ for all i G {0, 1,2,..., k — j}, and 
Yli=o a i~^ = 1, we have the equivalent problem: 

k-j 

Minimize: ^ ft + &k-j+i + . . . + A k -i + A k 

i=0 

k-j 

Subject to: I k -j +1 (t k -j) + ^ Pid^k-j+l = Imax 

i=0 

k-j 

Ik-j+n(tk-j) + y ] ftCj,fc-j+n 

2=0 

n-1 

+ A k -j+iC k -j+i jk -j+ Vn€{2,...,j + 1} 

i=l 

ft > Vi e {0, 1, 2, . . . , fc - j}, A fc _ j+ i > 0, . . . , A k > 

(7) 

The problems © and © are equivalent because we can 
transform d7]) to the original problem by using the relations 
Afc-j = A anc * = Afc* • • ^ ne degenerate case 

where A k -j =0 does not arise because if A k - 3 ; = 0, then no 
node transmits in stage (fc — j). We know from the induction 
hypothesis that only the nodes after node k — j in the ordering 
O opt transmit after stage (k — j). This means that node k—j 
never transmits, a contradiction. 

The second set of constraints in problem (|7]) can be written 
in matrix form as B + CA = I as shown in Table I where 
C is a lower triangular matrix with all diagonal entries being 
non-zero. Note that if we are using the optimal relay set such 
that each node transmits for a non-zero duration, then none 
of the capacity values in the diagonal of C can be zero. This 
is because each such term denotes the capacity value between 
successive relay nodes in the optimal transmission schedule. If 
any of these is zero, it would imply that a successor relay node 
is able to decode the packet without requiring its predecessor 
node to transmit. Then, one can simply remove the predecessor 
node from the optimal relay set, leading to a contradiction. 

Thus, we have: A = C _1 (I — B). Therefore each of 
the terms A^-j+i, Afc_j +2 , . . . , Afc_i, A k is linear in the 
variables ft, ft, ... , (3 k -j. Using this, the objective in © can 
be expressed as a linear function of these variables. Let this 
be denoted by /(ft, ft, . . . , ftc-j). Also we know that under 
the optimal solution, A k _j+i > 0, . . . , A& > 0. Thus, we can 
remove the last set of constraints from (|7]) without affecting 



the optimal solution (using Lemma [T]). Thus, © becomes: 
Minimize: /(ft, ft, ... , f3 k -j) 

k-j 

Subject to: I k - j+1 (t k -j) + ^ f3id,k-j+i = I ma x 

i=0 

ft >0 Vie{0,l,2,...,fc-j} (8) 

Similar to the stage (k — 1) case, this optimization problem is 
linear in ft with a single linear equality constraint and thus the 
solution is of the form where all except one ft are zero. Since 
<x fc J = A ^ , we have that in the optimal solution, exactly 

one of the fractions a ~ J , a r ~ J , . . . , ct k Zj is equal to 1 and 
rest must be 0. This implies that only one node transmits in 
this stage. Further, this node must be the relay node k — j 
that decoded the packet at the beginning of this stage. Else, 
node k — j never transmits. This is because by definition of 
stage (k — j), node k — j does not have the packet before the 
beginning of stage (k — j) and hence cannot transmit before 
stage (k — j). By induction hypothesis, only nodes k — j + 
1, k — j + 2, . . . , k transmit when stage (k — j) ends. Thus, if 
node k — j is not the node chosen for stage (k — j), it never 
transmits, contradicting the fact that it is part of the optimal 
set. This proves the Theorem. ■ 

D. Exact Solution for a Line Topology 

In this section, we present the optimal solution for a special 
case of line topologies. Specifically, all nodes are located on a 
line as shown in Fig. 01 and no two nodes are co-located. We 
assume that each node transmits at the same PSD P. Further, 
the transmission capacity dj between any two nodes i and j 
depends only on the distance dij between the two nodes and is 
a monotonically decreasing function of dij. For example, we 
may have that dj = log 2 (l + %^~) where P is the PSD and 

hij = -4- where a > 2 is the path loss coefficient. Under these 

v 

assumptions, we can determine the optimal cooperating set for 
the problem of routing with mutual information accumulation 
as follows. 

Lemma 2: The optimal cooperating set for the line topology 
as described above is given by the set of all relay nodes located 
between the source and the destination. 

Proof: (Lemma\2§: Consider the line network as shown in 
Fig. 01 We first show that the optimal cooperating set cannot 
contain any relay node that lies to the left of the source. 
Suppose the optimal set contains one or more such nodes. 
Then, we can replace all transmissions by these nodes with a 
source transmission and get a smaller delay. This is because 
the source has a strictly higher transmission capacity to all 
nodes to its right than each of these nodes. 
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n + 1 +i+i_ 



Fig. 4. A line topology. 



Next, we show that the optimal cooperating set must contain 
all the nodes that are located between s and d. We know 
that s is the first node to transmit. The first relay node that 
decodes the packet is node 1, since link s — 1 has the smallest 
distance and therefore the highest transmission capacity among 
all links from s to nodes to the right of s. From Theorem Q] 
we know that once node 1 has decoded the packet, it should 
start transmitting if it is part of the optimal set. Else, it never 
transmits and the source continues to transmit until another 
node can decode the packet. Suppose that the optimal set does 
not contain node 1. Then, we can get a smaller delay by having 
node 1 transmit instead of s once it has decoded the packet. 
This is because node 1 has a strictly higher transmission 
capacity to all nodes to its right than s. Thus, we have that 
the optimal set must contain node 1. 

The above argument can now be applied to each of the 
nodes 2, 3, . . . , n as in Fig. [U This proves the Lemma. ■ 

To get an idea of the reduction in delay achieved by us- 
ing mutual information accumulation over traditional routing, 
consider the line topology above with n nodes placed between 
s and d at equal distance such that d^+i = 1 for all i. Also, 
suppose the transmission capacity on link i — j is given by 
Cij = 3£ where 7 > is a constant. Then the capacity of link 

ij 

s — 1 is 7P, the capacity of link s — 2 is ^ , the capacity of 
link s — 3 is and so on. Define OAjP. Then, the minimum 
delay for routing with mutual information accumulation is 
given by Y%=o A* where: 



C s i 



Imax a Imax 
— 7T - 7 ^1 — 



A C S 



A 9 



(n+l-i) 2 



+ 1 



For simplicity, let us ignore the contribution of nodes that are 
more than 3 units away from a receiver. Then, we have: 

(n + l)I max - f £r=o A, - § Et"o 2 A, 



n 



(n + l)I n 



0(1 + \ + h) 



< 



(n + l)I max + § A + f 2A I max ( n + 1 



od + \ + \) 



1 + 1 + 1 
J- t 4 -r 9 



where we used the fact that A n ,A n _i < A . The mini- 
mum delay for traditional routing is simply (n + 1)A = 
(n + l)^ 22 ^. Thus, for this network, the delay under mutual 
information accumulation is smaller than that under traditional 



routing at least by a factor , 1N . ... ■ : - t , 

& ^ (n+l)(l+i + i) 

73% for large n. 



that approaches 



36 
49 



IV. Minimum Energy Routing with Delay 
Constraint 

Next, we consider the problem of minimizing the sum total 
energy to transmit a packet from the source to destination 
using mutual information accumulation subject to a given 
delay constraint D max . This problem is more challenging 
than problem (QJ since in addition to optimizing over the 
cooperating relay set and the transmission order of nodes in 
that set, it also involves determining the PSD values to be used 
for each node. Further, a cooperating relay node may need to 
transmit at different PSD levels during different stages of the 
transmission schedule. 

A. Problem Formulation 

Consider a transmission strategy (similar to the one dis- 
cussed in Section IIII-Ab that is described by a cooperating 
relay set 1Z of size \1Z\ = k and a decoding order O. Let 
the terms Aj and be defined in a similar fashion. Also, 
let Pij denote the PSD at which node i transmits in stage j. 
Then for any transmission strategy Q that uses the subset of 
relay nodes 7Z with an ordering O, the minimum sum total 
energy to transmit a packet from source to destination subject 
to the delay constraint P>max is given by the solution to the 
following optimization problem: 



Minimize: ^ ^ Aij Pij 



j=0 i=Q 



Subject to: ^ Aj < 



3=0 



-1 m-l 



AijCim(Pij) > Vra G {!,...,& + 1} 



-0 j=0 



Y^Aij < Aj Vj e {0, 1, 2, ... , k} 

i=0 

Aij,Pij > G {0, 1,2,..., k}J e {0, 1, . . . , k} 
Aij = 0, P^ = 0\/i> j 

Aj > Vj G {0, 1, 2, . . . , A:} (9) 

where the first constraint represents requirement that the total 
delay must not exceed D max . The second constraint captures 
the requirement that node m in the ordering must accumulate 
at least Imax amount of mutual information by the end of 
stage m — 1 using all transmissions in all stages up to stage 
m — 1. Note that in the second constraint, Ci m (Pij) denotes 
the transmission capacity of link i — m in stage j and it is a 
function of P^, the PSD of node i in stage j. Also note that 
© is not a linear program in general, since the Ci m (Pij^ may 
be non-linear in P^ . 

B. Characterizing the Optimal Solution of ([9|) 

Let lZ opt denote the subset of relay nodes that take part in 
the routing process in the optimal solution. Let k = \TZ op t\ be 
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the size of this set. Also, let O opt be the optimal ordering. Note 
that, by definition, each node in !Z opt transmits for a non-zero 
duration (else, we can remove it from the set without affecting 
the sum total energy). Finally, let P° 3 pt denote the optimal PSD 
used by node i in stage j. Then, similar to TheoremQ] we have 
the following: 

Theorem 2: Under the optimal solution to the minimum 
energy routing with delay constraint problem (|9]), in each stage 
j, it is optimal for only one node to transmit, and that node 
is node j. 

Proof: (Theorem 0): Note that given any fixed set of 
power levels Pij , the problem ^ becomes linear in the other 
variables and is similar to the problem (OD). Thus, similar 
arguments as in the proof of Theorem Q] can be applied to 
establish this property and we omit the details for brevity. ■ 

Although Theorem [2] simplifies the optimization problem 
©, it does not yield a greedy transmission strategy applied 
over all subsets (similar to the one in Section IIII-CI) for 
computing the optimal solution. This is because the transmis- 
sion order generated by the greedy strategy depends on the 
power levels used. For general non-linear rate-power functions, 
different power levels can give rise to different decoding orders 
for the same relay set under the greedy strategy (see Appendix 
B for an example). Thus, solving © may involve searching 
over all possible orderings of all possible subsets. However, for 
the special, yet important case of linear rate-power functions, 
this problem can be simplified considerably. A linear rate- 
power function is a good approximation for the low SNR 
regime. For example, in sensor networks where bandwidth is 
plentiful and power levels are small, it is reasonable to assume 
that the nodes operate in the low SNR regime. In the following, 
we will assume that the transmission capacity Qj(P^) on link 
i — j is given by Qj(P^) = jPihij (in units of bits/sec/Hz) 
where 7 is a constant and Pi is the PSD of node i. Then, we 
have the following: 

Theorem 3: For linear rate-power functions, the decoding 
order of nodes in the optimal set 7Z op t under the greedy 
transmission strategy is the same for all non-zero power 
allocations. Further, the sum total power required to transmit 
a packet from the source to the destination is the same for all 
non-zero power allocations. 

Proof: (Theorem \3§: We prove by induction. Consider 
any non-zero power allocation used by the nodes in 1Z opt . 
The source is the first node to transmit. Let it be indexed by 
0. Also, suppose the source uses PSD Po > 0. Under the 
greedy transmission strategy, the source continues to transmit 
until any node can decode the packet. This node is the one 

that minimizes A = cvp%) = r^S over a11 i e T^opu 
which is the time to decode the packet. Clearly, this node is 
the same for all Po > 0- Let it be indexed by 1. Also, we have 
that: 

A I max a j-y I max; 

tj /, — ^ A oP3 = — r — 
7^b hoi 7^oi 

which shows that the total power used in stage is independent 
of P . Next, let the PSD of node 1 be Pi. Then, in stage 1 
under the greedy transmission strategy, node 1 transmits until 
any node that does not have the packet yet can decode it. This 



node is the one that minimizes over all i G 7Z op t \ {!}• 

Imax — AoCoj(Po) _ Imax ~ A^Pphpi _ ^maxi^ ~ J^) 

Cu jPihu jPihu 

Clearly, this node is the same for all Pi > 0. Let it be indexed 
by 2. Also, we have that: 

T (1 _ h02\ J (1 ^02_^ 

jPih 12 7/112 

which shows that the total power used in stage 1 is independent 
of P and Pi. 

Now suppose this holds for all stages {0, 1, 2, . . . , j — 1} 
where j — 1 < k. We show that it also holds for stage j. Let 
the PSD of node j be Pj. Under the greedy strategy, node j 
continues to transmit in stage j until any node that does not 
have the packet yet can decode it. This node is the one that 
minimizes over all i G 7Z op t \ {1, 2, . . . , j}: 

T - V J 1 A C (P ) T -7V J_1 A P h 

± max / j Tri —{) L - x m K - J mi\- L m) ± max l / j rn —{) L - X m ± m' l mi 

Cji(Pj) = jPjhji 

From the induction hypothesis, we know that each of the terms 
A m P m for all m G {0, 1, ... , j — 1} is independent of the 
power levels P m . Thus, we have that the node that minimizes 
the expression above is the same for all Pj > 0. Further, the 
total power used in stage j is given by 

Ar) I max 7 y^ rr? — q ^mPmhmi 

11 = ^~ 

which is independent of Po , Pi , . . . , P 3 ? . This proves the The- 
orem. ■ 

C. A Greedy Algorithm 

Theorem [2] suggests a simple method for computing the 
optimal solution to © when the rate-power function is linear. 
Specifically, we start by setting all PSD levels to the same 
value, say some P > 0. From Theorem [3 we know that 
the sum total power required to transmit a packet from the 
source to the destination is the same for all non-zero power 
allocations. Then, solving © is equivalent to solving the 
minimum delay problem (Q]) with given power levels, except 
the delay constraint. This can be done using the greedy strategy 
described in Section UlI-CI If the solution obtained satisfies the 
delay constraint D max , then we are done. Else, suppose we 
get a delay D > D max . Then, we can scale up the power level 
P by a factor n D and scale down the duration of each stage 
Aj by the same factor. This ensures that the delay constraint 
is met while the sum total power used remains the same. 

V. Minimum Delay Broadcast 

Next, we consider the problem of minimum delay broadcast 
for the network model described in Section HH In this problem, 
starting with the source node, the goal is to deliver the packet 
to all nodes in the network in minimum time with mutual 
information accumulation. We assume that there are n nodes in 
the network other than the source. Similar problems have been 
considered in [22] -[24] which focus on energy accumulation 
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and where the goal is to broadcast the packet to all nodes 
using minimum sum total energy. 

A. Timeslot and Transmission Structure 

For the minimum delay broadcast problem, the transmission 
strategy and resulting time timeslot structure under a general 
policy is similar to the one discussed for the minimum delay 
routing problem in Section IIII-A1 Specifically, let O be the 
ordering of the n nodes that represents the sequence in 
which they successfully decode the packet under any strategy. 
Without loss of generality, let the nodes in the ordering O be 
indexed as 1, 2, 3, . . . , n. Also, let the source s be indexed 
as 0. Initially, only the source has the packet. Let to be 
the time when it starts its transmission and let 1 1 , t2 , • • • ,t n 
denote the times when nodes 1, 2, . . . , n in the ordering O 
accumulate enough mutual information to decode the packet. 
We say that the transmission occurs over n stages, where stage 
j, j G {0, 1, 2, . . . , n — 1} represents the interval [tj, tj+i]. 
Note that in any stage j, the first j nodes in the ordering O 
and the source have the fully decoded packet. Thus, any subset 
of these nodes (including potentially all of them) may transmit 
during this stage. For each j, define the duration of stage j as 
Aj = tj+i—tj. Also, let Aij denote the transmission duration 
for node i in stage j. As before, we have that A^ = if i > j, 
else A^ > 0. The total time to deliver the packet to all the n 
nodes is given by T tot =t n -t = Sj=o A?'- 

B. Problem Formulation 

For any transmission strategy that results in the decoding 
order 0, the minimum delay for broadcast is given by the 
solution to the following optimization problem: 

n-1 

Minimize: T tot = ^ Aj 

3=0 

Subject to: 

m — 1 m — 1 

y ] y ] AijCi m > Imax Vm G {1, 2, . . . , 71 } 

i=0 j=0 
3 

J2^ij < Aj Vj G {0, 1, 2, . . . , n — 1} 

A^ > Vi G {0, 1, 2, . . . , n — 1}, j G {0, 1, 2, . . . , n — 1} 
A^ = Vi > j 

Aj > Vj G {0, 1, 2, . . . , n — 1} (10) 

This is similar to (OD) except that the set 7Z contains all n 
nodes and that d is not necessarily the last node to decode the 
packet. As in (Q]), the first constraint captures the requirement 
that node m in the decoding order O must accumulate at 
least Imax amount of mutual information by the end of stage 
m — 1 using transmissions in all stages up to stage m — 1. 
The second constraint means that in every stage j, the total 
transmission time for all nodes that have the fully decoded 
packet in that stage cannot exceed the length of that stage. 
Similar to ©, the above problem is a linear program and thus 
can be solved efficiently for a given ordering O. This is the 
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Fig. 5. Optimal timeslot and transmission structure for minimum delay 
broadcast. In each stage, at most one node from the set of nodes that have 
the full packet transmits. 

approach taken in [[23l (with energy accumulation instead of 
mutual information accumulation, and with the objective of 
minimizing total energy for broadcast instead of delay) that 
proposes solving such a linear program for every possible 
ordering of the n nodes, resulting in n! linear programs. In 
the next section, we show that the above computation can 
be simplified by making use of a structural property of the 
optimal solution that is similar to the results of Theorems \T\ 
and |2 This results in a greedy algorithm that does not require 
solving such linear programs to compute the optimal solution. 

C. Characterizing the Optimal Solution of (ITU 

Let O opt be the decoding order under the optimal solu- 
tion. Suppose the the nodes in the ordering are labeled as 
{0, 1, 2, . . . , n — 1, n} with being the source node. Then, 
similar to Theorems [T] and [2 we have the following: 

Theorem 4: Under the optimal solution to the minimum 
delay broadcast problem (fT0|h in each stage j, it is optimal 
for at most one node to transmit. 

While Theorem [4] states that under the optimal solution, at 
most one node transmits in each stage j, unlike Theorems Q] 
and [2 it does not say that this node must be node j. In fact, 
this node could be any one of the nodes that have the full 
packet. Specifically, let Tj be the node that transmits in stage j. 
Then, using TheoremlU we have that Tj G {0, 1, 2, . . . , j}. The 
optimal timeslot structure for the minimum delay broadcast 
problem is shown in Fig. [5] Note that unlike the optimal 
timeslot structure for the minimum delay routing problem (Fig. 
0, here it is possible for a node to transmit more than once 
over the course of the broadcast. This property does not reduce 
the complexity of finding the optimal solution from 0(n\) 
linear programs to 0(2 n ) runs of a greedy strategy. However, 
as we show in Section IV-D1 it still leads to a greedy algorithm 
for finding the optimal solution that does not require solving 
n! linear programs like in [23]. 

Proof: (Theorem 0): The proof is similar to the proof 
of Theorem [T] and therefore, we only provide a sketch here, 
highlighting the main differences. 

Note that Theorem [4] trivially holds in stage (since only 
the source has the full packet in this stage). Next, similar to 
Theorem [T] in the last stage (i.e., stage (n— 1)), only the node 
with the best link (in terms of transmission capacity) to node 
n in the ordering O opt should transmit in order to minimize 
the total delay. Let this node be labeled r n _i. However, unlike 
Theorem [T] we cannot claim that this node must be node n — 1 
in the ordering O opt . This is because while r n _i has the best 
link to n, it does not necessarily have the best links to all those 
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nodes in the decoding order O opt that come after r n _i. Thus 
r n _i could be any one of {0, 1, 2, . . . , n— 1}. This shows that 
under the optimal solution, in the last stage (n — 1), only one 
node r n _i transmits. Using induction, we can show that in 
every prior stage (n — j) where 1 < j < n, at most one node 
needs to transmit. 

Consider the (n — 2) th stage. At time t n -2, all nodes 
except n — 1 and n have decoded the packet. Let the mutual 
information state at nodes n — 1 and n at time t n -2 be 
I n -i{t n -2) and I n (t n -2) respectively. Also, suppose in the 
(n — 2) th stage, relay nodes 1, 2, . . . , n — 2 and the source 
transmit a fraction a™ -2 , a^ -2 , • • • , <^n-2 an d <^o~ 2 °f me 
total duration of stage (n — 2), i.e., A n _ 2 , respectively. Note 
that these fractions must add to 1 since it is suboptimal to have 
any idle time (where no one is transmitting). Then, the optimal 
solution must solve the following optimization problem: 

Minimize: A n _ 2 + A n _i 
Subject to: 

n-2 

,n — 1 — I max 

i=0 
n-2 

In(tn-2) + A n _ 2 ^ a™ 2 Q n + A n _iC rri 

_i,n — I max 

i=0 

r\ ^ „n — 2 „n — 2 „n — 2 ^ i 

n-2 

i=0 

A n _ 2 >0,A n _x >0 (11) 

Here, the first constraint states that node n— 1 must accumulate 
at least i ma x bits of mutual information by the end of stage 
(n — 2). The second constraint states that node n must 
accumulate at least I m ax bits of mutual information by the 
end of stage (n — 1). Note that in the last term of the left 
hand side of the second constraint, we have used the fact that 
only node r n _i transmits during stage (n — 1). 

It is easy to see that under the optimal solution, the first 
and second constraints must be met with equality. This simply 
follows from the definition of the beginning of any stage j 
as the time when node j has just decoded the packet. Next, 
let ^ = A n _ 2 <~ 2 for all i e {0,1,2,..., n - 2}. Since 

Er=o 2 <~ 2 = 1. we have that 5Xo 2 A = A n-2 and CD is 
equivalent to: 

n-2 

Minimize: ^ ft + A n _i 

i=0 

n-2 

Subject to: l n -\ ,n — 1 — Imax 

i=0 
n-2 

l-ni^n — 2) + ^ ^ fiiCin + A n _xC rri _ 1?n — Imax 
i=0 

A n _i >0,ft >0 Vie {0,1, 2,..., n-2} 

(12) 

Note that problems (ITTb and (IT2l) are equivalent because 
we can transform ([T2t to the original problem by using the 



relations A n _ 2 = Ya=o A and = In me 

degenerate case where A n _2 = 0, we have that no node 
transmits in stage (n — 2), so that Theorem H] holds. Note 
that this is different from the scenario in the minimum delay 
routing problem where, by definition, the length of a stage in 
the optimal solution is strictly positive. Here, it is possible for 
a stage to have zero length. This happens when two or mode 
nodes can accumulate enough mutual information and decode 
the packet simultaneously. 

Using similar arguments as in Theorem [T] it can be shown 
that when A n _ 2 > 0, then in the optimal solution exactly 
one of the fractions o^ -2 , a \~ 2 -> • • • •> a n-2 ^ s e( l u al to 1 and 
rest must be 0. This implies that only one node transmits in 
this stage. Combining with the case where A n _2 = 0, we 
have that at most one node transmits in stage (n — 2). We 
label this node as r n _2. Note that r n _2 could be any one of 
{0,l,2,...,n-2}. 

Using induction, it can be shown that in every stage (n — 
j),2 < j < n, at most one node labeled r n -j transmits. 
Further, r n -j could be any one of {0,1, 2, — j}. This 
proves the Theorem. ■ 

D. A Greedy Algorithm 

Theorem [4] can be used to construct the following simple 
algorithm for computing the optimal solution to dTUb . The 
algorithm runs in n stages. In each stage j, < j < n — 1, 
the algorithm performs (j + 1)! separate runs. Each run 
corresponds to selecting one transmitter from the set of nodes 
with the full packet at the start of that run and having that 
node transmit until a new node decodes the packet. Thus, the 
number of nodes with the full packet increases by one at the 
end of each run. 

Let Sij denote the set of nodes that have the full packet at 
the end of the i th run of stage j. We have that the size of Sij 
is equal to \\Sij || = j + 2 for all i. Further, there are (j + 1)! 
distinct such sets at the end of stage j so that the algorithm 
performs (j + 2) x (j + 1)! = (j + 2)! runs in the next stage 

(i + i). 

To see this, note that we start at stage with only the source 
having the full packet and perform only one run. At the end 
of this stage, suppose node 1 has the packet. Then we have 
<?io = {<§, 1}. In next stage (i.e., stage 1), we perform 2! = 2 
separate runs as follows. In the first run, s is chosen as the 
transmitter for stage 1 and continues to transmit until another 
node (say x) gets the packet. This yields <Sn = {5, l,x}. In 
the second run, 1 is chosen as the transmitter for stage 1 and 
continues to transmit until another node (say y) gets the packet. 
This yields S21 = {s, 1,2/}. Thus, at the end of stage 1, we 
have 2! = 2 sets, <Sn and S21, of size 1 + 2 = 3 each. This 
procedure is repeated in stage 2 resulting in 3 runs starting 
with <Sn and 3 runs starting with 52 1. Thus, in stage 2, the 
algorithm performs (1 + 2)! = 6 runs and yields 3! = 6 sets, 
5i2, 522, • • • , 562, each of size 2 + 2 = 4, at the end of stage 2. 
In the same way, it can be shown that in stage j, the algorithm 
starts with j\ sets of size j + 1 each, performs (j + 1)! runs 
and results in (j + 1)! sets, each of size j + 2. 

The algorithm terminates after stage (n — 1) where it 
performs n\ runs and when all nodes decode the packet. 
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The optimal solution is obtained by picking the sequence of 
transmitting nodes that yields the minimum delay. 

It can be seen that the complexity of this algorithm is 0(n\) 
Essentially, this algorithm performs an exhaustive search over 
all possible feasible decoding orderings. This corresponds to 
searching over all possible values of Tj G {s,l,2,...,j} in 
every stage j (See Fig. 0. However, unlike (23), it does not 
require solving any linear programs. 

VI. Distributed Heuristics and Simulations 

The greedy algorithm presented in Section IIII-CI to com- 
pute the optimal solution to problem (QJ has an exponential 
computational complexity and is centralized. In this section, 
we present two simple heuristics that can be implemented in 
polynomial time and in a distributed fashion. We compare the 
performance of these heuristics with the optimal solution on 
general network topologies. We also show the performance of 
the traditional minimum delay route that does not use mutual 
information accumulation. 

Heuristic 1 : Here, first the traditional minimum delay route 
is computed using, say, Dijkstra's shortest path algorithm on 
the weighted graph (where the weight Wij of link i — j is 
defined as the time required to deliver a packet from i to 
j, i.e., = ^g^). Let M denote the set of relay nodes 
that form this minimum delay shortest path. Then the greedy 
algorithm as described in Section UlI-CI is applied on the set of 
nodes in M. Note that we are not searching over all subsets 
of mE Thus, the complexity of this heuristic is same as that 
of any shortest path algorithm, i.e., 0(\M\ 2 ). 

Heuristic 2: Here, we start with M as the initial cooperative 
set. Then, while applying the greedy algorithm of Section 
IIII-CI if other nodes that are not in M happen to decode the 
packet before the next node (where the next node is defined 
as that node in M that would decode the packet if the current 
transmitter continued its transmission), then these nodes are 
added to the cooperative set if they have a better channel 
to the next node than the current transmitter. The intuition 
behind this heuristic is that while M is expected to be a 
good cooperative set, this allows the algorithm to explore more 
nodes and potentially improve over Heuristic 1. 

In our simulations, we consider a network of a source, 
destination, and n relay nodes located in a 10 x 10 area. The 
location of source (1.0, 2.0) and destination (8.0, 8.0) is fixed 
while the locations of the other nodes are chosen uniformly at 
random. The link gain between any two nodes i and j is chosen 
from a Rayleigh distribution with mean 1. For simplicity, all 
nodes have the same PSD. The total bandwidth W and packet 
size Imax are both normalized to one unit. The transmission 
capacity of link i — j is assumed to be Cij = log 2 ^1 + ^ 
where dij is the distance between nodes i and j and a is the 
path loss exponent. We choose a = 3 for all simulations. 

In the first simulation, n = 25 and the network topol- 
ogy is fixed as shown in Fig. [6) We then compute the 
traditional minimum delay route and the optimal solution 

3 It may be possible to get further gains by searching over all subsets of 
M., but the worst case complexity of doing so would again be exponential. 
Our goal here is to develop polynomial time algorithms. 
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Fig. 6. A 25 node network where the routes for traditional minimum delay, 
Heuristics 1 and 2, and optimal mutual information accumulation are shown. 

for routing with mutual information accumulation using the 
greedy algorithm of Section IIII-CI We also implement Heuris- 
tics 1 and 2 on this network. Fig. [6] shows the results. It 
is seen that the traditional minimum delay route is given 
by [s, 1, 9, 22, 19, 23, 25, 18, 10, d] while the optimal mutual 
information accumulation route (according to the decoding or- 
der) is given by [s, 1, 9, 22, 19, 16, 24, 17, 12, 23, 25, 18, 10, d). 
The decoding order of nodes under Heuristic 1 is same as that 
under the traditional minimum delay route while that under 
Heuristic 2 is given by [s, 1, 9, 22, 19, 16, 23, 25, 18, 10, d] . The 
total delay under traditional minimum delay routing, Heuristic 
1, Heuristic 2, and optimal mutual information accumulation 
routing was found to be 29.84, 23.73, 22.99 and 22.19 seconds 
respectively. 

This example demonstrates that the optimal route under 
mutual information accumulation can be quite different from 
the traditional minimum delay path. It is also interesting to 
note that the set of nodes in M is a subset of the cooperative 
relay set in this example. However, this does not hold in 
general. We also note that the delay under both Heuristics 
1 and 2 is close to the optimal value. Finally, while Heuristic 
1 only uses the nodes in M, Heuristic 2 explores more and 
ends up using node 16 as well. 

In the second simulation, we choose n = 20. The source and 
destination locations are fixed as before but the locations of the 
relay nodes are varied randomly over 100 instances. For each 
topology instance, we compute the minimum delay obtained 
by these 4 algorithms. In Fig. [71 we plot the cumulative 
distribution function (CDF) of the ratio of the minimum delay 
under the two heuristics and the traditional shortest path to 
the minimum delay under the optimal mutual information 
accumulation solution. From this, it can be seen that both 
Heuristic 1 and 2 perform quite well over general network 
topologies. In fact, they are able to achieve the optimal 
performance 40% and 60% of the time respectively. Further, 
they are within 10% of the optimal at least 90% of the time 
and within 15% of the optimal at least 98% of the time. 
Also, Heuristic 2 is seen to outperform Heuristic 1 in general. 
Finally, the average delay in routing with mutual information 
accumulation was found to be 77% of the average delay of 
traditional shortest path routing (where the average is taken 
over the 100 random topologies). It is interesting to note that 
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Fig. 7. The CDF of the ratio of the minimum delay under the two heuristics 
and the traditional shortest path to the minimum delay under the optimal 
mutual information accumulation solution. 

once the optimal relay set is computed, the only overhead 
of a mutual information accumulation based scheme over 
traditional shortest path is in the decoding and re-encoding 
operations. In particular, such schemes only require a 1-bit 
feedback from a receiver when it has successfully decoded 
the packet and do not require sophisticated synchronization, 
coordination, or extensive channel state information feedback. 

VII. Conclusions 

In this work, we considered three problems involving opti- 
mal routing and scheduling over a multi-hop wireless network 
using mutual information accumulation. We formulated the 
general problems as combinatorial optimization problems and 
then made use of several structural properties to simplify 
their solutions and derive optimal greedy algorithms. A key 
feature of these algorithms is that unlike prior works on these 
problems, they do not require solving any linear programs to 
compute the optimal solution. While these greedy algorithms 
still have exponential complexity, they are significantly simpler 
than prior schemes and allow us to compute the optimal 
solution as a benchmark. We also proposed two simple and 
practical heuristics that exhibit very good performance when 
compared to the optimal solution. 

Appendix A 
Proof of LemmaCQ 

We argue by contradiction. Suppose an optimal solution 
to © without the constraint x > is given by x' ^ x* . 
Then, we have that c T x' < c T x*. Further, x' satisfies all 
the constraints we did not remove, but must violate at least 
one of the constraints that we removed. Thus, we have that 
Ax' = b and x' ^ 0. Now let x" be a convex combi- 
nation of x* and x' , i.e., x" = Ox* + (1 — 9)x' where 
< < 1. We have that c T x" = 6c T x* + (1 - 6)c T x'. 
Since c T x' < 6c T x* + (1 - 6)c T x' < c T x*, we have that 
c T x' < c T x" < c T x*. 

Since x* satisfies the strict inequality constraint x > in 
all entries, there must be a ball about x* that still satisfies the 
constraint x > 0. Further, the line segment joining x* and x' 
intersects this ball. Let us choose 6 such that x" is this point 
of intersection. Then x" still satisfies the constraint x" > 0. 
However, c T x" < c T x*, which contradicts the fact that x* 
solves (13 optimally. 




Fig. 8. The 4 node example network used in Appendix B. 

Appendix B 
A Simple Example 

Here, we show an example where different power levels 
can give rise to different decoding orders for the same relay 
set under the greedy transmission strategy when the rate-power 
curve is non-linear. Consider the 4 node network in Fig. [8] We 
assume the rate-power curves on all links except link s — 3 are 
linear. Specifically, dj(Pi) = h^Pi for all ij ^ s3. However, 
the rate-power curve on link s — 3 is logarithmic and is given 
by C s3 (P s ) = \og 2 (l + h s3 P s ). 

Next, suppose h s \ > h s2j h s3 and hi 2 = hi 3 . Also, let 
Imax = 1. Then, node 1 is the first node to decode the packet 
for all P s > 0. Also, we have Ao = c ^ p ^ = h * p . 

The mutual information state at nodes 2 and 3 at the end 
of stage is given by I 2 (h) = A C s2 (P s ) = A h s2 P s 
and J 3 (ti) = A C s3 (Ps) = A log 2 (l + h s3 P s ) respectively. 
Under the greedy transmission strategy, after stage 0, node 1 
will continue to transmit until any of nodes 2 or 3 decodes the 
packet. Suppose node 1 uses transmit power Pi > 0. Then, 
the time for node 2 to decode if node 1 continues to transmit 
is given by: 

, _ Imax — h(tl) _ 1 — Aph s 2P s _ 1 ~ jf~[ 

2 " C 12 (Pi) ~ h 12 Pi ~ h 12 Pi 

Similarly, the time for node 3 to decode if node 1 continues 
to transmit is given by: 

. Imax ~ h(tl) 1 ~ Aq log 2 (l + h s3 P s ) 

3 " Ci 3 (Pi) " h 13 Pi 

, _ log 2 (l+/i 53 P s ) 
h sl P s 

hisPi 

Since hi 2 = his, from the above we have that 5 2 > 5% if 
h s2 P s < log 2 (l + h s3 P s ) and 5 2 < S 3 if h s2 P s > log 2 (l + 
h s3 P s ). Let h s2 = 0.05, h s3 = 0.1. Then, for P s = 1, we get 
5 2 > S 3 since 0.05 < log 2 (l.l). However, for P s = 100, we 
have that 5 2 < 5 3 since 5 > log 2 (ll). This shows that different 
power levels can give rise to different decoding orders for the 
same relay set under the greedy transmission strategy when 
the rate-power curve is non-linear. 
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